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Abstract We provide an asymptotic expansion of the maximal mean squared error 

(MSE) of the sample median to be attained on shrinking gross error neighborhoods 

about an ideal central distribution. More specifically, this expansion comes in powers 

of n"'^^, for n the sample size, and uses a shrinking rate of «"''^ as well. This refines 

corresponding results of first order asymptotics to be found in |Rieder] ( |1994[ ). 

In contrast to usual higher order asymptotics, we do not approximate distribution 

functions (or densities) in the first place, but rather expand the risk directly. 

Our results are illustrated by comparing them to the results of a simulation study and 

to numerically evaluated exact MSB's in both ideal and contaminated situation. 
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1 Motivation/introduction 

1 . 1 Simulations as starting point 

This paper was initiated by a simulation study performed by the present author and 
M. Kohl at Bayreuth university in 2003 for a presentation to be given in the frame- 
work of an invitation by S. Morgenthaler to EPF Lausanne. The goal was to investi- 
gate the finite sample behavior of procedures, which are distinguished as (first order) 
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asymptotically optimal in infinitesimal robust statistics as to maximal MSE on ^ln- 
shrinking (convex-contamination) neighborhoods. The results of this study for one 
dimensional Gaussian location were so promising already for sample sizes n down 
to about 20 that it seemed worthwhile to dig a little deeper. At closer inspection of 
the results, we realized that the approximation quality of this first order asymptotics 
could even be much enhanced down to sample sizes « = 5 and 10 if we ignored 
samples where more than half the sample stemmed from a contamination. 

Asymptotically, in our shrinking neighborhood setting, such events carry positive, 
but exponentially-fast decaying probability for any sample size. 



1.2 Description of the main result and discussion 

These empirical findings can indeed be substantiated by theory, deriving a uniform 
higher order asymptotic expansion for the MSE on correspondingly thinned out neigh- 
borhoods for the median, location M-estimators for monotone scores, and one-step- 
constructions. This paper deals with the median case. It is separated from more gen- 
eral location M-estimators, as the techniques used there are not available for the me- 
dian due to a failure of a Cramer condition. Moreover, in higher order asymptotics, 
even for the ideal model, differences appear between diverse variants of the median 
used for even sample size, a fact which to the author's knowledge has not been spelt 
out in detail so far. 

Denoting by 'ZY„(r) the neighborhoods thinned out by cutting away samples with 
more than 50% contaminations, and by M„ a suitable variant of the median, the main 
result of this paper is 

sup «[MSE(M„,G'"')] = + r^) + ^aj + iaz) + o(i) (1.1) 

with fli and 02 certain functions in r, /o,/i and, for 02, in fi, where r is the contam- 
ination radius and /] are the values of the ideal density / and its first and second 
derivatives evaluated at the ideal median. 

As a byproduct of the main result, we are able to give necessary and sufficient 
conditions for a contamination to attain the RHS of it is astonishingly small: 
all mass of the contaminating measures has essentially to be concentrated either left 
of -const -y/log(n)/n or right of const -y/log(n)/n. 

In formula we already recognize the following features of the result: 
The speed of convergence of the MSE to its asymptotic value is uniform on the whole 
(modified) neighborhood, and is one order faster in the ideal model; besides, we may 
work with the original risk (instead of using a modification as usually). 

The expansion in powers of «"'^^, in the ideal model with first correction term at 
n"' , comes surprising: Using first order von Mises expansions (compare ( |1.8| l below), 
in the context of quantiles (comprising the sample median), it can be shown by means 
of Bahadur- Kiefer representa tions that the approximatio n error of this expansion is 
an exact Ofn(n '^'*) — cf. e.g. lJureckova and Sen ( 1996 1. So one would expect that 



under uniform integrability, the first correction term in an expansion of type \\.\\ in 



the ideal model would be of order n too. In fact, Duttweiler |( |1973| showed that 
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the L2-norm of the remainder is of exact order 0(n '^'*) in our scaled up setup. These 
results are no contradiction to though, as the remainder of course is correlated 
with the asymptotic linear terms. We still do not see however how Bahadur- Kief er 
representations translate into 

In any case, the approximations of type prove very reasonable when compared 
to both numerical and simulated values of the MSE for finite n. 
With the same techniques, we deal with a number of variants of the median for even 
sample sizes, and specialize these results for the case of F = N(0, 1). For odd sample 
size, we also derive asymptotics of this kind for the variance and bias separately. 

In proving the main theorem, we use MAPLE at large to calculate tedious, lengthy 
asymptotic expansions which are hardly presentable in the framework of an article. 



1.3 Setup 

We study the accuracy of the sample median as a location estimator on shrinking 
neighborhoods: We work in an ideal location model T = {Pg \ 6 e R} with location 
parameter 6, observations X,- '~ Pg and errors m,- given by 

Xi^Ui + 0, Ui '~ F (1.2) 

Due to translation equivariance in the location model we may limit ourselves ioO -Q. 
We assume that 

f(0) = l/2 (1.3) 

i.e.; the location parameter 9 equals the median of the observation distribution, and 
that F around admits a Lebesgue density / with Taylor expansion about as 

/(x) ^h+fiX+\ fix^ + 0(^2+^0)^ y;, > (1.4) 
for some > 0. Furthermore, Finally, we assume that there is a ^ > such that 

J \xtf{x)dx<oo (1.5) 

Remark 1.1 (a) Condition )1.5) is taken from |jureckova and Sen||"l982| and is both necessary 
and sufficient for finiteness of Ef |M„|'' for any y > 0, where M„ is the sample median Med„ to odd 
sample size n, respectively any variant of the sample median considered in this paper for n even — for a 
proof see subsection |A. 1| 
(b) By the Holder-inequality, J |.v|'' F(dx) < oo for each < f; < 5, so we may assume that 5 < 1. 

We want to assess both variance and bias simultaneously, so we work with the setup 
of shrinking neighborhoods as in ,Rieder, ( ,1994^ , i.e. as deviations from the ideal 
model \\.2\ , we consider the set Q„ - Q„{r) of distributions 

G<"' 0G„,i, G„j = (1 - r/^/^)F + r/V^//,,,; (1.6) 
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for arbitrary, uncontrollable contaminating distributions //„_,. As usual, we interpret 
G*"' as the distribution of the vector (X,),<„ with components 

Xi (1 - Ui)Xf + UiXf (1.7) 

forXf, Ui, Xf stochastically independent, Xf F, Ui Bin(l,r/ -y/n), and (Xf) ~ 
Hn for some arbitrary //„ e A^i(B"). In this setup the median can be understood as 
an asymptotically linear estimator with influence curve i/'Med, allowing the expansion 

Med,, = i 2:;Li -AMed(^/) + OF-'{n-'l\ ^Med(^) = ^ (1-8) 

— c.f. |Riede7| ( |1994[ Thm. 1.5.1.)- Using a clipped version of the quadratic loss func- 
tion for the estimator S„ - Med„, 

MSEm(5„,G) := EG(min(«52,M)), (1.9) 



we may proceed as outlined in Rieder ( 1994 p. 207), and obtain 



lim lim supc„„ n [MSEM(Med„, G*"')] = i'^faTHl + r^) (1.10) 

In this paper we want to (a) examine the approximation quality of ( |1.10| l, spelling out 
higher order error terms and (b) discuss the accuracy of this approximation by com- 
paring it to both numerical evaluations of the exact MSB's and simulation results. 
In contrast to usual higher order asymptotics, instead of giving approximations to 
distribution functions (or densities) by Edgeworth expansions or using saddlepoint 
techniques — cf. e.g. [Field and Ronchetti,(1990) — we proceed by expanding the risk 
directly. 

As indicated, for (a) we need to modify the neighborhoods, admitting only such sam- 
ples where less than half of the sample is contaminated, that is 2 Ui < nil in ( |1.7| i. 
As a side effect of this modification, we will (c) get rid of the somewhat artificial, as 
statistically unmotivated, modification of the loss function by clipping ( |1.10| l, which 
is common in asymptotic statistics, see, among others, |Le Cam, ! , 1 986) , ,Rieder| ( , 1 994) l, 
[Bickel et al.| ( |T998l l, |vairder Vaart| ( |T998l l. 



1 .4 Organization of the paper 



We start with discussing the mentioned modification in detail in section |2] The cen- 
tral theoretical result. Theorem 3.2 is presented in section [3] We then present some 
ramifications in sectionlS^covering in particular several variants of the sample me- 



dian for even sample size in Theorem 3.4 and Proposition |3.5[ also corresponding 
expansions are given for bias and variance separately in Proposition 3.10| Results are 



spelt out in the special case of F — 7V(0, 1) in Corollaries 3.11 and 3.12 These theo- 
retic findings are illustrated with numerical and simulated results in section]?] In the 
appendix sectionjA] we give proofs to all our assertions. 

Remark 1.2 it took me some time to write things up in a readable fashion: In order not to slay down 
the reader with vast number of terms, in the proof section, we instead describe verbally how we got them 
referring to a corresponding MAPLE script available in the internet. To give you an idea of how tedious 
terms become, we have included a page of MAPLE output on page|24|as a horrifying example. 
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2 Modification of tiie slirinliing neiglibortiood setup 

The shrinking-neighborhood setup guarantees uniform weak convergence of any 
as. linear estimator (ALE) to corresponding normal distributions on a representative 
subclass of the neighboring distributions of form ( |1.6| l — those distributions induced 
by simple perturbations Q„{(,t), see |Rieder| (| 1994 p. 126). 

By the continuous mapping theorem, uniform weak convergence of these ALE's on 
Q„ entails uniform convergence of the risk for continuous, bounded loss functions 
like the clipped version of the MSE ( |1.9| l. However, even this (uniform) weak con- 
vergence does not entail convergence of the risk for an unbounded loss function like 
the (unmodified) MSE in general, as we show in the following proposition: 



2. 1 Convergence failure of the MSE for the median 

Proposition 2.1 Let P be the location model from \\.2\ with f{Q) > and let Med„ 
be the sample median. Then for each odd n — 2m + 1 and to any C > there is an 
xqeR such that with G^"^ = [(1 - -^)F + ^ 

MSE(Med,„Gj,"^) > C (2.1) 

although, uniformly in Q„, 

(Med„ -iZUj (Amc. dG„,i) o G("^ ^ 7V(0, (2/(0))-^) (2.2) 



2.2 Modification of the shrinking neighborhood setup 



In view of proposition 2.1 a straightforward modification for finite n consists in per- 
mitting only such realizations of Ui,. . ., U„, where K - Y^Ui < nil. More precisely, 
for < e < 1/2, we consider the neighborhood system (3„(r, e) of conditional distri- 
butions 

G^"' = X{[(1 - Ui)Tl + t/,Xf ],- 1 lim sup - y f/; < e ) (2.3) 

I „ n ^ 



If we apply the Hoeff^ding inequality ( |Hoeff^ding| ( [T963j Thm. 2)) io K ^ Y^l^ f/, for 



the switching variables f/, from we obtain 

P{K > m) < exp ( - 2n(e - -^f) (2.4) 

which shows the announced asymptotic exponential negligibility of this modification. 
Thus all results on convergence in law of the shrinking neighborhood setup are not 
aff'ected when passing from Qnir) to Q„{r): Let B„ :- [K < n/2}. Then we have for 
any 6 > and any sequence of events A„ 

P(A„ I B„) = P(A„ n B„)/P{B„) = P(A„){1 + 0(e-2«-'/(i+''>)) 
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2.3 Connection to the breakdown point 

Our definition of the neighborhood (3„(r) combines the shrinking neighborhood con- 
cept, which will eventually dominate, with a sample-wise restriction; for some num- 
ber E € (0, 1) depending on the estimator 5„, we only allow for samples where strictly 
less than E(S„)n observations are contaminated. This number e(5'„) is actually just 
the finite sample (e-contamination) breakdown point of an estimator Sn introduced 
by |Donoho and Huber| ( [T983] l. 



Thus the concept easily generaUzes from the location case to other models: Let 
V - [Pg, e 0) be a parametric model and Xf be R*^-valued observations distributed 
according to the ideal situation Pg. We are interested in the question whether for some 
given estimator 5 „, we have uniform convergence of the risk J ({S „-&) dQ„ for some 
loss a >Q on some (thinned out) neighborhood or not. To this end we define Q„(r, s) 
analogously to ( |2.3| l. Assume that there is some e > such that for each n e N and 
k<k:- - 1 

so{S„) := inf {e*(X„_i,5„) | = (xi x„_i) a possible sample 

configuration, k < k\ > e > 

where e*{X,S) is the finite sample (e-contamination) breakdown point of S at sample 



X. Then, by an analogue argument to that of Proposition 2.1 the following proposi- 
tion holds: 

Proposition 2.2 Assume that I is unbounded. Then for any e > bq{S „) and r > 0, the 
maximal risk of S „ on Q„(r,E) is unbounded; in particular, uniform convergence of 
the risks does not hold. 

The other direction of this connection is more involved and is deferred to a sub- 
sequent paper. Under sUght additional assumptions, for suitably constructed ALEs to 
bounded influence curves and for continuous, polynomially growing loss functions, 
uniform convergence of the risk holds on (3„(r, e) for any s < s. Note that this thin- 
ning out for continuous loss functions £ is not needed if £ is bounded. 



3 Higher order asmyptotics for tlie MSE of the sample median 

For H e A1i(B") and an ordered set of indices / = (1 < /i < . . . < < n) denote Hj 
the marginal of H with respect to /. 

Definition 3.1 Consider three sequences c„, d,,, and Kn in R, in (0, oo), and /«{!,...,«), 
respectively. We say that the sequence (//*"') C M\{B>") is /<-„-concentrated left [right] 
of c„ up to o(d„), if for each sequence of ordered sets I„ of cardinality i„ < k„ 

l_//(«)((_cx,;c„]'") = 0(fl'„) [l-//f((c„,cx,y") = o(fl'„)] (3.1) 
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3.1 Main theorem 



Theorem 3.2 (a) In the location model ( |1.2[ ) with ideal central distribution F of 
finite Fisher information of location, we assume conditions ( |1.4| l to \l.5\ . Then for 
any e < 1/2, for G^"^ varying in Qn(r, s) of ( |2.3| l it holds 



sup n[MSE(Med,„G<"')] = — r(l +rH — a, + -aj + o(l/n)) (3.2) 
CM 4/2 V" n 



/or 



fli =2(1+^2)+— (3.3) 

. o . . 2 , . 4^ , 3r2(3 + r2) |/i| 3 + 6r^ + r\ /2 ^ 

(22 = (-2 + 3r + 3r ) + r- — + 

2 /2 12 /3 

> 

1 



5(3 + 6r^ + r^) ff 



(3.4) 



(b) 77ze maximal contamination is achieved by any sequence of contaminating 
measures (H„), such that for ki > 1 and k2 > V5/2, and for k„ — ^k\ryfn ^, eventu- 
ally in n, either 



(H„) is K,i— concentrated left of—-^ -\J\og(n)/n up to o(n ') (3.5) 

or 

(H„) is K„-concent rated right of ^ yjlog(n)/n up to o(n"') (3.6) 

More precisely, if f < [ f > 0], the maximal MSB is achieved up to 0(n"^) by 
contaminations according to ( |3.5| l /^( |3.6| l ], and according to either of the two if f — 0. 

Remark 3.3 (a) This result of course also covers the ideal model (r = 0), and is also relevant for 
the fixed neighborhood approach: If for fixed n, we formally plug in r = s V« (for .v small in comparison 
to V« ) this gives a corresponding result for the maximal MSE of the sample median on a neighborhood 
of fixed size s. ("formal", as we cannot control the remainder for arbitrary s < 1.) 

(b) If one is only interested in the behavior of « MSE up to order o(n"''^), one may weaken assumption 
\\A) to: For some 6 > 0, 

/W = /o+/i-v + 0(.v'+''), /o>0 (3.7) 

(c) Conditions \3.5\ and \3.6) imply that it is sufficient to contaminate F" by measures H„ the one 
dimensional marginals of which are either concentrated right of C ^Jlog(n)/n or left of -C ^Jlog{n)/n for 
some constant C > in order to obtain a maximal MSE — an astonishingly modest contamination! With 
respect to {1.8) , this is plausible however, as \ipMcd\ attains its maximal value for any x 0. 

The thinning out of the marginals by means of DefintionjxT] even tells us that of the n potentially con- 
taminating X* only all subsets of cardinality roughly Vn need to be "large" at all, the remaining coset (of 
cardinality order n(l + o(l))) of contaminations might even stem from the ideal situation! 
As shown in Proposition |3.9| conditions \3.5) resp. \3.6\ are almost necessary. 

(d) Th e sample median for odd sample size as well as all variants of the median considered in Propo- 
sition 34lcome up with the same leading term (1 + r-)/(4/jj) for n MSE — according to first order asymp- 
totics (l.IO) (with modified loss there!). 

(e) In all variants of the sample median considered in Theorem 1 3. 2 [ and Proposition |3.4| the second 
order correction is positive, so that for any r > we eventually underestimate the MSE by first order 
asymptotics. 
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3.2 Ramifications 



As simulations in section 4.2 were made for even sample size, we present an analogue 
to Theorem 3.2 for even sample size below. As there are infinitely many sample 
medians for even sample size, we consider the following variants: 

- the order statistics X[m„] 

- the order statistics X[(,„+i):„] 

- the randomized estimator M,' := UX[„r.n] + (1 - U)Xn,„+i)-„] with some random- 
ization U ~ Bin(l, 1/2) 

- the midpoint-estimator M„ := (X[,„.„] + X[(,„+i).„])/2 

- the bias corrected estimator M'„' :- (X[m-„] + j^) 



Proposition 3.4 Under the assumptions of Theorem 3.2 for even sample size n — 
2m, for the sample median variants ^[(m+i):«], M'^, M„, M'^, here denoted by 

Mn generically, for any e < 1 12, for G^"^ varying in Q„(r, s) of ( |2.3| l it holds 

sup n [MSE(M„, G^">)] ^l^i(l+r'-)+ ^(ai.o + flu |) + 



+ 



i(fl2,0 + fl2,l| +fl2,2| +fl2,3|)| +o(^) (3.8) 



for some real numbers aij = aij(Mn) which are given in detail in Proposition 3.5 



In any variant, the maximal contamination is achieved by contaminating measures 
H„ according to either condition p.5[ ) or p.6| l where the distinction between these 
two is made as in the case of odd sample size. 

Proposition 3.5 [Specification of the tenuis Uij] Splitting up 02,0, fl2,i> fl2,2 according 
to 

fl2,0 = fl2,0,'' + O2,0,c, fl2,l - 0.2,\,r + fl2,l,c, fl2,2 = fl2,2,r + fl2,2,c (3.9) 

we get 

(a) Identical terms for all variants.- 

02,3 = -^^6 fl2,2,c = -1/4, a2,2,r = ' n (i.lO) 



(b) Varying terms in the ideal model; 



aifiAM';) = -2, fl2,o,c(M„) = -3 

a2,0,c(^[m:n]) - fl2,0,c(^[(m+l):n]) - a2fi,c(M'„) 



(c) Remaining fl, y for M,„ M'^, and M,". 



fli,o(M;') = fli,o(M;) = fli,o(M„) = 2(1 + r\ 
flu(M;') = fli,i(M;) = fli,,(M„) = (r2 + 3)sign(/i)/2, 



(3.11) 



(3.12) 



«2,o,.(M;) = fl2,o,.(M„) = 3/ + 3r2 = fl2,o,r(M;') - 2r2 sign(/i) (3.13) 
a2,iAM'„) = fl2,i,c(M„) = 0, fl2,i,c(M;') = 1, 

n (M'\-n (M \- 3'-'(3+>-')sign(/i) _ 2 (3.14) 

«2,l,r(™„) - fl2,l,/(Mn) - 2 " fl2,l,)-(M„ ) - r 
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(d) Remaining a,-,; for X[,„;„] and X[(„,+i):„] ; 

a2,],c(^lm:n]) = 3/2 = -fl2,l,c(^[m:n]) (3.15) 

For X[„,:„] ant/ X[(m+i):„], condition ^3.5\ [ ^3.6\ ] applies if A-f^ > [<] - (3 + r^)/i 
Correspondingly, let 

1 for X[„,:„] 



-l/orX| 



[(m+l):«] 



(3.16) 



and 



(3.18) 



i = sign((3 + rVi + ^'4/o ) (3.17) 
Then the remaining ai j for X[„r.„] and X^m+iy.n] are given by 

ai,o = 2 + 2s' s + 2r^, 02,1,,- = 3i'i((3 + i)r2 + r'*)/2 
aiAr = 3r'* + (3 + 4.0'-2 

In case 4/q = i'(3 + r^)fi, both condition ^3.5) and p.6[ ) m/7 fo o(n"^) lead to the 
same MSB. 

Remark 3.6 in case of the sample median for odd sample size, 

. \ ('""+3)sign(/i) ^ 

ai.o = 2(1 oi.i = '^ZAc = "2, 

2,04 A 3r^(3+;-^) sign(/"i ) 

«2,o,r = 3' + 3r, a2,i,c = 0, 02,1,;- = — ^ — 2 ' 

a2,2,c - "4- '^2,2,r [2—. ''2.3 - 

Remark 3.7 it is a well-known consequence of the Jensen inequality that convexity of both loss and 
admitted estimation (or more generally decision) domain entails that randomization cannot improve an 
averaged estimator, compare e.g. |Witting|jl985| (1.2.98), p. 52). This is reflected by the fact that in both 
ideal and contaminated situation, M„ up to o(l/«^) has a smaller MSE than M,' — the only difference 
arising in term 02.0,1; ■ 

Remark 3.8 in the ideal model, as shown in jCabrera et al.||"l994| Theorem 1), one even has the pecu- 
liarity that, in our notation 



1 



MSE(M2„„f)-MSE(M2„,+i,F) = -— — +o(m-^) (3.19) 

16m /g- 

that is, evaluating the sample median at one more observation (from 2m to 2m + 1) deteriorates MSE! As 
our expansion already stops at o(fn"^), we cannot reproduce j3.19) to the given exactitude by means of our 
representations j3.2) and 

After correcting (minor) typing errors in formulae (2.2), (2.5), and (2.6) in the cited reference, we obtain 
\i.2) and jTsJ from (2.2) again; for details refer to the web-page to this article. 



Conditions (|3.5|l / (|3.6|l almost characterize the risk-maximizing contaminations: 



Proposition 3.9 Under the assumptions of Theorem 3.2 let do. Assume that, for K = 
Z'U Ui andk>{\ -5)ryfR, 

n 

Pr ( 2 Ui I(Xf < Vlog(«)/«/(2/o)) >l\K^k)>pa>Q (3.20) 
1=1 

Then, eventually in n, no such sequence of contaminations G^"^ € Q{r), can attain the 
maximal MSE in \3.2) as in condition \3.6) (i.e. with positive bias). More precisely, 

sup n [MSE(M„, G*"')] - n [MSE(M„, G^^"^)] > f^" ^ + o(l/«) (3.21) 



c<") 2n /o V27r 

A corresponding relation holds for condition (|3.5 
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With the same techniques we can also specify which parts of the MSE — up to order 
— are due to variance and which are due to bias; to this end let M„ be the sample 
median and the midpoint estimator M„ for odd resp. even sample size. 

Proposition 3.10 In the situation of Theorem . 
spelt out in p.5| l, p.6| l, leading to G["' in ( |2.3 i 



3.2 for contaminating measures H„ as 



n[Var(M„,G[;'0] = -7^ 

+ i(3.^-(5-(-I)")/2+^ 

|Bias(M„,G[,"')| = 



1 + 



1 

Wo 



it holds 



-\Jn 



r + 



l/iKr+i) 



) +o(i) (3.22) 



h 



(3.23) 



«[Bias^(M„,G[,">)] = + ^(2.^ + M^) + 

+ mT^ — — + (3.24) 

We next specialize Theorem 1 3. 2 1 and Proposition |3 .4| for the case of F - W(0, 1) for 
later comparison to numeric and simulated values. 

Corollary 3.11 In the location model about F - N(Q, 1), 

sup n [MSE(M„,G("')] = ^|(1 + r^) + f^a.^ + i(fl2,o + 27rfl2,2)| + o(i)(3.25) 

Corollary 3.12 In the location model about F — N(0, 1), in the ideal model 

«MSE(Med„,F) = ^[1 + (^ +fl2„,)/„] +o(i), (3.26) 

As numerical evaluation of p.26| l, we get in the three cases: 

(1.5708(1 - 0.4292/n) /orMed,,,^;' 
«MSE(Med,„F) = o(i) + 1 1.5708(1 + 0.5708/n) /orX,„™,,X[,„„„„,M;, (3.27) 

[l.5708(l - 1.4292/n)/o, M„ 

This means: We overestimate MSE(Med„, F) by the first order asymptotics for odd 
sample size n and with estimator M'^', and to an even higher degree, if we use M„. 
The risk of estimators X[m-„], X[(„,+i):„], M'„ however is underestimated. 



4 Illustration of the results 

To illustrate the approximation, we consider the case of F - N{Q, 1) with a number 
of numerical evaluations and a small simulation study. 
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num. 




error of asymptotics 






n 


exact 


lst/2nd order 


3rd order 




Var;;' 


absolute 


relative 


absolute 




rcl stive 








M„ 








5 


1 .4341 


1.366E- 1 


9.527 % 


1.790E- 


3 


0.125% 


1 1 


l.jUoo 


6.201 E - 2 


4.110% 


7.194E- 


4 


0.048 % 


101 


1.5641 


6.687 E- 3 


0.428 % 


1.174E- 


5 


0.001 % 






V 

^[„/2:n] 








O 


1 . /ZlU 


— 1 502 E — 1 


-8.728 % 


-7.715E- 


4 


—0 044 % 


10 


1 .6610 


-9.022E-2 


-5.431 % 


-5.560 E- 


4 


-0.033 % 


100 


1.5798 


-8.976E- 3 


-0.568 % 


-9.445 E - 


6 


-0.001 % 


6 


1.4776 


9.320E-2 


6.307 % 


-1.917E- 


2 


-1.297% 


10 


1.5106 


6.019E-2 


3.984 % 


-7.233 E - 


3 


-0.479 % 


100 


1.5641 


6.665 E - 3 


0.426 % 
M„ 


-7.681 E - 


5 


-0.005 % 


6 


1.2884 


-2.823 E- 1 


-21.913% 


-9.182E- 


2 


-7.126% 


10 


1.3832 


-1.875E- 1 


-13.557% 


-3.697 E- 


2 


-2.672 % 


100 


1.5488 


-2.200E-2 


-1.421 % 


-4.472 E- 


4 


-0.029 % 



Table 1 Accuracy of the asymptotics in the ideal model 



4. 1 Numerical Results in the ideal model 

In the ideal model, we have evaluated the integrals numerically, using formulas for 
the densities in the ideal model to be derived later in section |A] g„ for the sample 
median for odd sample size from \A.5\ and g„ for the midpoint estimator for even 
sample size from ( |A.48| l. For the numerical calculations, we have used R 2.11.0. 
Note that the hmit up to five digits in this case is 1.5708. Further sample sizes are 
available on the web-page to this article. 



4.2 A simulation study 
4.2.1 Simulation design 



Under R 2.11.0, compare R Development Core Team (2010 1, we simulated M 



10000 runs of sample size n - 5, 10, 30, 100 in the ideal location model V - N(6, 1) 
at = 0. In a contaminated situation, we used observations stemming from 

G?' = £{[(1 - UdX't + UiXri 12^'- ^"/^^ " ^ ' 

for Ui Bin(l,r/ V"), ^i' '~ MO, 1), X""' Inoo) all stochastically independent 
and for contamination radii r = 0. 1 , 0.5, 1 .0. Further results for n - 30, 50 and/or r - 
0.25, 0.5 are available on the web-page to this article. With respect to Remark [33] ([c|, 
the contamination point 100 will largely suffice to attain the maximal MSB on Q„. 
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n 


r 


sim 


[low; 


up] 


num 




-1/2 


,1-1 


n = 5 


0.00 


1.423 


1.384 


1.464] 


1.434 


1.571 


1.571 


1.436 




0.10 


1.652 


1.602 


1.701] 


1.671 


1.587 


1.728 


1.613 




0.50 


3.014 


2.917 


3.111] 


3.045 


1.963 


2.842 


3.258 




1.00 


4.525 


4.394 


4.655 ] 


4.509 


3.142 


5.952 


8.853 


n = 10 


0.00 


1.371 


1.333 


1.410] 


1.383 


1.571 


1.571 


1.346 




0.10 


1.534 


1.491 


1.578] 


1.521 


1.587 


1.687 


1.472 




0.50 


2.980 


2.882 


3.078 ] 


2.916 


1.963 


2.584 


2.636 




1.00 


5.723 


5.568 


5.879] 


5.735 


3.142 


5.129 


6.422 


n = 30 


0.00 


1.518 


1.476 


1.560] 


1.501 


1.571 


1.571 


1.496 




0.10 


1.614 


1.569 


1.659] 


1.579 


1.587 


1.644 


1.573 




0.50 


2.400 


2.331 


2.469 ] 


2.390 


1.963 


2.322 


2.339 




1.00 


5.391 


5.245 


5.538] 


5.255 


3.142 


4.289 


4.720 


n = 100 


0.00 


1.546 


1.503 


1.589] 


1.549 


1.571 


1.571 


1.548 




0.10 


1.585 


1.541 


1.629] 


1.597 


1.587 


1.618 


1.597 




0.50 


2.165 


2.106 


2.223 ] 


2.171 


1.963 


2.160 


2.165 




1.00 


4.010 


3.911 


4.108] 


3.952 


3.142 


3.770 


3.899 



Table 2 Asymptotics compared to numerical and simulational evaluations 



rel.err 


order 


r = 0.00 


r = 0.10 


r = 0.25 


r = 0.50 


r= 1.00 


1% 


1st order asy. 


143 


320 


2449 


10016 


40127 




2nd order asy. 


143 


133 


85 


124 


479 




3rd order asy. 


17 


17 


25 


48 


124 


5% 


1st order asy. 


29 


9 


92 


406 


1629 




2nd order asy. 


29 


25 


10 


30 


101 




3rd order asy. 


7 


9 


11 


20 


46 



Table 3 Minimal no s.t. for n > no the relative error using first to third order asymptotics for approximat- 
ing maxMSE(Med„) on Q„(r,E) is smaller than 1% resp. 5% 



4.2.2 Results 

The simulated results for n MSE(Med„, G) ) come with an asymptotic 95%-confidence 
interval, which is based on the CLT for the variable 

empMSE„ = 2:,.[Med„(samplep]2 (4.1) 

We compare these results to the corresponding numerical "exact" values and to the 
asymptotical values for approximation order rfi, n '^^, n ' respectively. For even n we 
take the midpoint-estimator which is the default procedure in R. For the numerical 
evaluations we use density formulas from section |A] ^ for odd sample size from 
( |A.9| l and the integrand from ( |A.50| l for even sample size. 

For the ideal situation we had simulation results available for all runs to r 0, so the actual sample size 
for r = is 40000. 



4.3 Discussion 

The numerical results of subsection |4. 1 1 show an excellent approximation quality of 
our formulas ( |3.2| ) and ( |3.8| l with specifications ( |3.9| l to ( |3.14| i in the ideal model. In 
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maximal MSE tor Median 



r=0.00 r.0.10 




20 40 60 80 100 20 40 60 80 100 



lirst order asymptotics 
+ second order asymptotics 
x ttiird order asymptotics 



Fig. 1 The mapping n i-> rel.error{maxMSE(Med„)) for F = N(0, 1). 



particular the different under/over-estimation properties of the different median vari- 
ants are closely reflected by the numerical results. The approximation quality of the 
midpoint estimator indicated in p.27[ ) is somewhat less well supported by the numer- 
ical results, which is probably due to the fact, that by iterated numerical integration 
the accuracy of the numerical approximation will be inferior to the other variants. 
In the contaminated situation, empirical and numerical results also strongly support 
our assertion of a good approximation quality down to moderate to very small sample 
sizes, as long as the contamination radius r is not too large: For n - 5 upto radius 
r = 0.1, for n - \Q (almost) upto r - 0.25, for n > 30 upto r = 0.5, all approxima- 
tions up to o(«"')-terms stay within an (empirical) 95%-confidence interval around 
the (empirical) MSE (multiplied by «). 

In any case, higher order asymptotics yield more accurate approximations than first 
order ones, and upto case n - 5, the 1/n-terms improve the approximation with re- 
spect to the l/«'^^-terms. 

A closer look is provided by figure [T] (and, zooming in for « > 16, there is an addi- 
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tional figure on the web-page). Indeed for all investigated radii r — 0, 0.10, 0.25, 1.00, 
the relative error of our asymptotic formula w.r.t. the corresponding numeric figures 
is quickly decreasing in absolute value in «; also, we notice a certain oscillation be- 
tween odd and even sample sizes induced by the different definitions of the sample 
median in these cases. In table |3] we have determined the smallest sample size no 
such that for n > «o the relative error using first to third order asymptotics for ap- 
proximating maxMSE(Med„) on Q„ir) is smaller than 1% resp. 5% which shows that 
for r < 0.5 we need no more than 20 (50) observations to stay within an error corridor 
of 5% (1%) in third order asymptotics. For first order asymptotics, however we need 
considerable sample sizes for reasonable approximations unless the radius is rather 
small. 



A Proofs 

A.l Proof of RemarkfTTta) 



Let n = 2m + 1 and y e (0, 1). Necessity: With F = 1 - F, by integration by parts and Holder inequality to 



exponent m + 1, we obtain that for any T > 0, and some constants K',K > and a - 



IMedJ'' = Ji Ixl^ Fix)'" Fix)'" F(dx) > 



Jureckova 



;"(!:)/' 

>A:max( r x^-' Fi-x)""^' dx, f .y''-'F(a:)'"+' t/.v) > A"( f \xf F{dx)f*^ 

Sufficiency: Under condition jT3J, for gs(t) = \t\^F{t)F(t) and gs := sup, gs(t) it holds — cf. 
land Sen|(T982| (2.37)) 

|a<oo, limg6(t) = 0, h:= \ [F{t)F{t)f dt < oa > Ijd (A.l) 

|/Hoo J 

Hence for any n > 1 + lyjS, it follows h = m + (\ - y)IS > IjS and hence 

Ef IMed,,!'' = nyp"'] J x''-^[F"'+\x)F"'(x) + F'"(-x)F'"*\-x)]dx < 



The arguments for even sample size are similar. 



A.2 Proof of Proposition |2T| 

The assertion for uniform normality follows along the lines of |Rieder]jl994| Theorem 6.2.8): Although the 
assumed uniform Lipschitz continuity of the scores tp — (68), p. 23 1 in the cited reference — fails, a look 
into the proof of the theorem shows that this condition only is needed to achieve conclusion dL(6) = Ij. on 
p. 235, which in our situation is the case anyway. 

Assertion \2 \) is shown by a breakdown-point argument: We interpret G*"* according to jl.7) , where for 
this proof '~ \xf,\- observe that Med„ > jcq surely under G*"' as soon as K = the number of 

observations stemming from I|.vo|, is larger than m. But, K being a binomial vaiiable, the event {K > m] 
carries positive probability p„. So setting xq := ^jCj p„ , we get 

MSE(Med„,G|,"') = E^.oCMed^) > Eg,„,(Med2 1|^>,„|) > xlp„ = C □ 
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A. 3 Outline of the proof of Theorem|3.2| 



As in the theorem we define n = 2m + 1 and first consider the situation knowing that exactly K = YiUj = k 
observations have been contaminated, to values say jfi, . . . ,Jf,t. More specifically, it will be sufficient to 
consider — for each fixed t — the number 



j = k(t) := #{.v,- : Xi > t] 



(A.2) 



In this situation we will derive the (conditional) probability that the (unique) median Med„ is not larger 
than t and derive its density. We then fix some ki > I and k2 > V5/2 and split up the proof according to 
the following tableau 





K <kiryln 


k\r^i < K < pn 


K> pn 


\t\ < k2 Vlog(«)/n//o 


(I) 


(III) 




k2 Vlog(n)/n//o < |/| < 


(11) 


excluded 


\t\ > 


(IV) 





For cases (11) to (IV), we will show that they contribute only terms of order o(n"') to « MSE(Med„) 
and hence can be neglected. Applying Taylor expansions at large, we derive an expression in which it 
becomes clear, that independently from t and eventually in n, the maximal MSE is attained for 7^(0 either 
identically k or identically for all / in (1) — or equivalently all .v, are either smaller than - ^ -^log(n)/n or 
larger than j- -^\og(n)/n. Integrating out first f and then k we obtain the result. 



A. 4 X(Med„) in ideal and contaniinated situation 
A.4.1 Ideal Situation 

Lemma A.l LetXj '~ P real-valued random variables. Then 
If dp = p dA, then Xyt„] has density 



(A.3) 



g(f) = «/'(f)(^_ %(tf-\l - P{t)f-<' 



(A.4) 



In particular the density of the sample median for odd sample size n = 2m+ 1 in the situation ofTheorem \3.2\ 
is 

I2m\ 

g„{t) = n/(0 F(0"(1 - F(t)r (A.5) 



Proof The proof is standard, but as we will need some terms later, we pass thi'ough the main steps here: 
For fixed f e R we introduce 7, := I{x,<(}- Then the following events are identical 



{X[t„] <t] = [#i : IXi < t] > k] = {J]y, > k] 



(A.6) 



The fact that F, ~ Bin( 1 , P{t)) entails jA3j. {AA) follows by simple differentiating, \A.5) by plugging in 
k = m + I. □ 
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A.4.2 Contaminated situation 

By jO), X, = (1 - Ui)Xf + UiXf, and thus fixing again t e R, also 

y,- = (1 - C/,)F;'' + f/,Ff (A.7) 

with correspondingly defined variables. As we sum up the F, in jA.6) . only S„ = 2 5'i will matter. As 
indicated in the outline, we split up the event {5„ > in] by realizations of K, and in the section [K = k] we 
may suggestively write S„ = S'^_f. + Sf, giving 

{Med,, <t] = OtliStk + > ml n = 4) 
Splitting up again this event by the realizations of 5*, we get 

{Med,, <t\ = U^LoU;-ol5;^t >m-j]n {Sf = j] n[K = k] (A.8) 

Thus, for the moment, we may consider the situation that exactly k observations, < k < m, are contam- 
inated, and exactly j = ji,(t) of the contaminated observations are larger than t and denote that event with 
Dj,kj- As {X[^„_y.„_j.] ^ 'I is independent from Djj,j, with F = I - F, the the conditional density of Med,, 
knowing is 

g..j.k{t) ■■= {n - kf"" ~ %tr'-JFitrJ-''f{t) (A.9) 
\m- J I 

Thus abbreviating again by j, we get the following representation 

ni k ^ 

«MSE(Med„,G<"') = « E I ^^^t = = ^) (^10) 

k=a j=o 



A. 5 Auxiliary results 

Before starting with the results we need some preparations 



A.5.1 Stirling approximations 

We start with writing down some approximations for the factorials and the binomial coefficients derived 
from the Stirling formula to be found e.g. in Abramowitz and Stegun ( 1984 6.1.37): 

= (s&)'"^(jrR V («.Ax*-;)2. (l +p„,,,), for - ^ - 4^ < p„,.k < Ti;(A.ll) 

= (|T)'"'(^)""'"'V5^TS&(^-ffi+°(^^ for;,/: = 0(V^), (A.12) 
The next lemma will be needed to settle case (III): 
Lemma A.2 Let 

K := k,\ogk[ + I -ki (A.13) 

Tlten it holds that 

Pr(Bin(n, r/ ^/n) > kir^/n) < exp( - at r + o( )) (A. 14) 

Proof We first note that k > 0, as log(A:) > for jc > 1 and *■ = J*' log(jc) dx. By Hoeffding's inequality 
jHoeffding||1963| Thm. 1, inequality (2.1)), we have for f,, = 1, n i.i.d. real-valued random variables, 
<M,i.i = E[f i] and < e < 1 - 

P(li:.^,-,>-e)^[[^r{^t^-]" (A.15) 
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Applying jA.lS^ to the case of h independent Bin(l,r/ 'Jn ) variables, we obtain for B„ ~ Bin(H, r/ VfT) 
and < £ = (ii - l)r/ V« < 1 - r/ ViT : 

Pr(B„ > «:irVn) < exp(-<:irVn log(*:i) + (n - /:ir V" )( log(l - ^) - log(l - <:i -^))) 

For .Y e (0, 1), -jr^ < log(l - x) < -.v. Thus the difference of the logarithms is smaller than {k\r)l(^(\ - 
k[ rj ■\fii )) - r/ V" and 

Pr(B„ > ki r V« ) < exp (- Kr V« + o( Vn )) 



Corollary A.3 Let X ~ Bin(n, r/ V«)- Then for each i e No 

[Al4| 

Proof E[y I|x>i-, , ys) ] ^ «' Pr(X > kir^fn) const «' exp(-K r V«) □ 
Lemma A.4 We have that for j, k = Oi -Jn) 

Proof Using the fact that j, k = 0( Vn ), we note that 

Si = l/2+*5^ + 1;#^+o(„-i) (A.18) 
By (14), (O), F(0 = 1/2 + /or + o(0; thus 1^ - F{t)\ = | 0(^ ) - fot\. □ 

Lemma A.5 Let X ~ Bin(ji, p). Then, for p = r/ Vn. 

E[X] = rn"^, E[X^] = i^n + m"- - r, (A.19) 
E[A:'] = r^n^"*/- + 3r^n + (r- 3r^)n"^ - 3r^ + 2r^«"'/% (A.20) 
E[X''] = r^n^ + 6i^n^'^ + (7r^ - 6r*)n + (r - l&r^)n^'^ + +llr'' - 7f^ + 12r^r"- - er^n"' (A.21) 

Proof Cf. the MAPLE-procedure Binmoment on the web-page. P 

Finally, we note the following Lemma for A^(0, 1 ) variables 
Lemma A.6 LetX ~ M(0,l). Then for k e 1^ and any c> V2, 

E[|X|*I„^, -J = o(n"') (A.22) 

Proof Let 0(x) := Pr(X < x), := I - 0, (fix) the density of Z. Then 

[ Pk(x) if(x)\ for k odd 

ErZ* I 1 = •! ic-yiogCii) 

|X>. Vtoiwi ] p^(;t) + Yti]ai - mixf , k even 

for some polynomial of degree k-l. The assertion follows, as ip(c ^\og(n)) = ip(Q)n^'^~^^ = i^(0)n"''^''* 
for some 6 > Q, and because for the 0(jr)-term, 0(x) < if{x)/x for .r > 0. □ 
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A.6 Proof for odd sample size 

We recall the density g„j,it from (Aj9j: 

g..j.M ■■= (n - k f"" " %tr'-JF{tr^-''f{t) 
\m- J I 



So the integrand of interest is n f gn,j,k(.t)- Applying the Stirling approximation jA.12) to the constants, we 

get 

2m -k\ ,2m-k\m-j, 2m -k sm-k+j 

.] = ( r) I r—.j y„j,k (A.23) 

J I ^ m — J ' ^m — k + j' 

with 

y".J.t ■■= A/ (...+;-q(»i;)2:. (l (^-24) 

for Pmj.k from )A. 12) . As F(t)"'^^ F(t)"'*'^'' suggests an asymptotic decay, we will expand gnj,k at the 
mode of F(t)"'^' F(t)"'*^^^ . Differentiating, we easily get that 

,„ ;- ^ f m — i \m-i^m+ i — k\m+J-k 

F(0"-FW--',(^) (^-^) (A.25) 
with equality iff f = x„j^k for 

Introducing 

we see that 

gn.j.k{t) = (n- k)y„,,,km[\ + ^AF„,j,kr-'[\ - ^ zJF„,,,t]"'+J-* (A.28) 



■■= (A-26) 



'^F„,j,k := Fit) - S = F(t) - F(Xn,j,k), (A.27) 



Case (III): For k e [kir^ln,En\, < j < k, we partition the terms according to jA.28) and see that on 
\t\ < n^, the integrand nfi g,^ jk(t) multiplied by rc"^ is olrP) for each f ixed t and is dominated by /(f) 
and hence by dominated convergence tends to as n — > oo. But Lemma [A.2|yields that Vv(K > k\r^) 



decays exponentially in n, hence is even o(n *), so as noted, (111) is indeed negligible asymptotically to 
order o(«"'). 

k 



A.4 



Case (II): Here k <k\r yfn and \t\> ^ -^\og{n)/n, or equivalently by Lemma j 

\AF„_j,k\ > k2 Vlog(n)/n (A.29) 
Now for X > 0. log(l + x) < X and for < jc < 1, log(l - x) < -x - x- 12. Hence, we obtain eventually in n 



A.12 

^9 



gn.imim <in~ k)y„,j,k exp(- ^^^^zlf ^^.p W (n-k) ^/^^^/^(l + iS;) ^1 log(m)] (A.30) 
Plugging in that m - k/2 > m - k^r yfmj2 in (11), we get 

gn.j.k(t)/f{t) < constm^*2(l + o(«'')) = o(n-'-) (A.31) 
where the last equality is a consequence of k2 > V5/2. So negligibility follows by dominated convergence. 

Case (IV): We only treat the case t > n^; a con'esponding relation holds for t < —r?. Under \23) , for n 
lai-ge enough, we obtain bound g„j^k < n2'"/(0F(0'*'"^^'""'"^- Let 7]=\/2-E,b = 2/6 and 6' e (0, 1). 
By choosing n large enough, we may achieve that F(n^) =: A„ < 2"'''' and F{r? f'^ > \ — 6' . So by jA.l) , 
we get eventually in n and for some constant c and any if > 0, and gs from the proof of Remark[l.l[a) 



1/2-2* 



n [ t^g,uj.k(t)dt< -^2" f [t'F(t)F{t)]''[F(t)mtdtAT-^'' 

-^i^^(24)" = cexp(-|logi„l4'?- ^ - ^ - ^]) ^ exp(-,'„) 
(1 - S')A„' 
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Case (I): Here we restrict ourselves to the case that 

k<h^lJ,r, I - F(t)\ < k2 Vlog(«)/« (A.32) 

Doing so, we set u := 1 - ,t„ y^. As on (1), k = 0{ V«) as well as we make this magnitude explicit to 
MAPLE in the function transf by introducing the bounded variables 

k:=k/^/m and J := (<:/2 - ;)/ Vm (A.33) 

This gives the expansion in powers of m"''^ 

{m- j)l(2m-k)= 1/2 + 7/(4 Vm) + ^//(Sm) + o(«"') (A.34) 

Thus, to get an approximation to x„jj( = F'(jI^), we expand this in a Taylor series in powers of m"''^ 
(compare our MAPLE-procedure asquantile) which gives 

Furthermore, 

f(x„.j,k) = fo- ^(2/o«i'/2) + ( - fff + Ifjljk + /2/o;')/(8/3m) + o(l/«) 
which implies that in (1), by jA32j, u lies in a shrinking compact, as 

u = F-'(F(0) - f-'(F(^,,M)) = f(x„.j.kr'iFit) - ^) + o( Vlog(«)/«) = 0( ^log(n)/n). 
Setting AF,^ jit := F(t) - F(x„ji^), and expanding this in a Taylor series around 0, we get 

^^n.;.* = fou + fl («^/2 + M.V„j.t) + /2(n'/6 + (m^A-„j-.«. + UX^„ j f,)/2) + o(«"^'^) 



and 



f(t) = /o + /i(« + x„j.i) + /2((« + x„j,i,)f/2 + o(n ') 



We turn to the constant factors now; up to now, the terms arising by applications of the Stirling formulas 
of subsection |A.5. 1 [ come with i:-terms in the nominators. As we want to integrate over K later, however, 
it is preferable to move these terms into the denominators by Taylor approximations — here performed by 
the functions asympt and collect in MAPLE (compare our function asbinom): 

n(n - k) ^y„,j,t = 2 i m^'^ [ i - + ] + o(« i ) (A.36) 

Spfi! + <25tfii = 4m(l - ijl + ^iM) + o(„) (A.37) 

2{m-j) 2{m+j-k) v„,l/2 m-'^-' 

Next we expand [1 + ^/IF„j^kr'JU - /IF„.;.;t]"'+^"*: We plug in {A35) , set 

D-2 := Hmfl y := »o-„ (A.39) 
and apply the Taylor expansion exp(j:) = 1 + jc + j:^/2 + o(,t^). This gives 

[1 + a£2^^f„_^.,]--./[i _ zlF„.,.t]'"+^-* = exp(-//2) h(y, J, k, n) + o(«-l) 

with 



hiy 



y,},k,n) = 1 + ((I - ^) / - ^ V2y3)m-'/2 + P(y,k,Dm-' (A.40) 
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where P is some polynomial depending on /b./l./i with deg{P){y) = 6 the exact expression of which 
may be drawn from the MAPLE-script. Accordingly, we define J(„j_k '■= ^nj.kO'n, and, with tp the density of 
N(0, 1 ), use the abbreviations 

ip{t) = tfoyou(t), h(t,J,k,n) = h(y o u(t), J,k,n) (A.41) 

We also introduce the integration domains 

A„,j,t = {f e R 1 1 ^ - F(t)\ < k2 ), A„,j,t = {\t\< h ^/^(l + o(nO))//o) (A.42) 

Finally, applying )A. 12) and jA.36) , we derive an integration constant c„ jk from ynj,k from jA.24) : 

Cn.i.k ■■= 2-im-^2y„ jj. = 1 - i/(4m'/^) + (16/ - I6jk + 3P + 12)/(32m) (A.43) 
Plugging this all together, we obtain 

r nt^g„,,t{t)dt = (c„,j_k + o(^)) f 2im^'^ ff{t)^{t)h{tj,k,n)dt 
Substituting t{y) = , we get 

nt^g„,j,k(t)dt = j c„,;.t(l + f + f^tiyf + o(n-'))ip(y)h{y,]Xn)^^^^^ Ii„,., {t{y))dy 
As i„j,t = 0(«"), 

[\y + x„j,k\ < 2k2 VlogWd + o(n°))} = [\y\ < 2k2 y/log{n)(l + o(n"))) =: A° 
For the aggregation of the factors we use MAPLE, giving 



4/o 



¥>(>•) rf>' (A.44) 



for polynomials in y, Pi.„ii and ^"211/* obtained by our MAPLE-procedure getasintegrand, where 



Pj.„ j{ is defined as 



, V2//i(2-/) / V2vKy^ + l) .yVi(3-V) \ - / (3+.y^)^ (4-5 / ) V2 /, >■ \ j2 _ A/ r3 _ /l V2 v -4 
l,fl + 32/4 +^ ,f2 + 8^4 8/„2 + 4/4^ 4/4 ^ 



and ^2;,,./t as 



p ..(-.A ^ t\v'i-2Y4-y^)+/(7-4/) /|t V2Y3(-2+5j^-y'') /2/(3-/) /f/lr-S) 
2-.nJ,k^> ,28/2 128^4 ,52^^ 256/„<' 

, ^ V2(F><9+6/+3/)+28>-(3-/)) t /i .y^ (-2.y-'+5y2+3) VlyVzdZ-S/) , VZ yVi'(3.y--l3) _ 



mfl 32/4 192/5 128/6 



/ F(45+18>-2+3).4)-100/-24/+84 /i I V2:y(12-/-5/) //2(9-5y-) /i-y2(13y4-41y2-12) -i -2 
I 192/2 + + 48/5 + ,28/J + 

/ V2y(3-5y2) _ kfj/ , /2 V2.y(IO-9y2) /j' V2>-(3/-5/-4) ^ 3 /2(l-3/) /i2(2y4-l) \ ^ 

M 12/? 16/4 + 56/^5 + 32/6 }] +\if„2+ 48/jj5 + 32/^6 }J 

By the restriction in Ajj, we obtain that |y| = 0( yj\og(n)), while Asg(P\ .\y) = 5 and disg(P2-.',y) = 8. 
Hence, the integrand is apparently of form (y + V2 J)^/(4/o^) + 0{^\og{n)^ /n), and thus, eventually in n, 
is maximized — up to 0( yj\og(n)^ jn ) — for |J| maximal, i.e. |7| = i/2. Even more so, if f\ = 0, the nr^l~- 
term, too, is maximized for \j\ = k/2. As the highest power in P2;. occurring to y in a /-term without /i is 
4, the integrand is maximized up to 0((log(ii)^/n)) for \j\ = k/2. 

Condition |J| = k/2 is equivalent to j;,(t) = kov jk(t) = 0. But this is the case — up to o(«"') — if condition 
031 or )3.6) is in force, as then up to mass of order o(n"') the contamination is either concentrated left 
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of - ^ login) /n or right of ^ -^log{n)/n for any sample with no more than k[i- ^Jn contaminations. With 
respect to jA.31( , this suffices to obtain that (II) is o(n"'). 

Later, after having integrated out we will see that if /i =0, the approximation up to order ' is identical 
for jk(t) = k and jk{t) = 0, whereas if fi > it pays off for nature to contaminate by positive values and, 
correspondingly, by negative values if /i < 0. We consider jii(t) = k here. 
Up to o(ji"'), J^^^^^nt^g„xk(t)dt is 

r [ 772 (>•' + + Si-,, i(y)m-'/2 + Qj.^ j(>.)m-' + g{n, k,y)]^(y) dy (A.45) 
JaS 'o 

with some skew-symmetric polynomial g in v of degree 5 ffiat is uniformly bounded in n on Ajj, and 
for some even-symmetric polynomials Ql;«,t(.>') and 22;/i,/t()'); we only present the definition of Qi;„,a(.v) 
below; for Q2;ii,k(y), we refer the reader to the corresponding MAPLE-procedure getasrisk. 

O (v) - 3P I ( hCt'-m P-2k\ 2 .Ik , fA.A 



Using Lemma A. 6 we see that we may drop the restriction |v| < Iki ^Jlog{n) and integrating v out, up to 
o(n"'), we get that ^^^^^^np-gnxk(t)dt is 

+^6/2 + 16/4 32y^5 + ,28^*^" +^2/„= 384/5 + 54^4 j.j^^'' J'« 

Corollai'y |A.3| gives that we may ignore the fact that k is restricted to <: < kir^Jn and so with Lemma p^.5| 
we may simply integrate out k. After substituting n = 2m + 1 we thus indeed get 



sup n [MSE(Med,„ G<'"*)] = -^((1 + r^) + -5=(2(i+.-) + . 
G(") 4/^-= ( v« 2;„ 

Considering both cases jk(t) = k and /'<:(?) = simultaneously, we get {3.2) with {3.3) and l |3.4^ . 



A.7 Proof of Proposition|3.4| — ^pure quantiles and randomization 



3.2| we calculate the bias of V"^[(m+l):n] under F. This gives |Bias(X[,„+i:„], = B„ [ + 

L_ IB I- i-^'i 

2/0 V^' "■ 8/3 



The proof for the pure quantiles is just as in the odd case and thus skipped. We only draw the attention 
to the dift'erent behaviour of the 1 / Vn-coirection term for positive and negative contamination which 
explains 3.17t in this case. For the bias corrected version M,'/, with the same techniques as in the proof of 
Theorem . 
S„_2 for 

B„ = B„,i + B„2, B„,i = \B„2\ = (A.46) 

.'0 

The same terms but with dift'erent signs are obtained for Vn |Bias(X[„,:„], F")|. We only consider B„ i here, 
which arises no matter if we have symmetry or not and gives the bias corrected version M',' with the a, y 
terms as in Proposition |3.5| 

Remark A.7 We note that in all variants of the sample median up to now a minor deterministic improve- 
ment is possible if fi # 0, when we consider the bias-corrected estimators 

M,'; := M„ - -^B„2 = M„ + (A.47) 
Except for the pure quantiles for even n, this renders all variants bias-free up to o(j!"') in the ideal model. 
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A.8 Proof of Proposition [34] — the midpoint-estimator 

For the midpoint-estimator M„, we need the common law of the pure quantile estimators X[„, „] and 
X[(ra+i):n]- So more generally, we start with the common law of (Y,Z) := {X[y^„],X[y^-„]) for 1 < vi < 



V2 < n, Xi '~' F, i = and F(dx) = f(x)dx, see David 1970 pp. 9-10), and in our case (n=2m. 



V|=m, V2=m + 1) leads us to the density of the midpoint estimator M2,,, = {Y + Z)/2 

g„{t) = (2mf iy""' ' j J [F(2f - u)(l - F(u))f"'-'^ f{u) f(2t - u) du (A.48) 

This gives for (2m)MSE(M2,„, F), after substituting s = 2t - u, and using Fubini 
(2m)MSE(M2m,f) = 

^'"^ ^""m 1 / n [^^'^^^ " Fiu))r-''f(u)mdsdu (A.49) 

We skip the argument showing how to choose a risk maximizing contamination. In the MAPLE script, 
however, we have detailed out a corresponding argument for j{f) the number of contaminated observations 
larger than t. Without loss of generality, we work with the case of contamination to the right. Analogue 
arguments as in the preceding cases show that given we have k observations contaminated to 00, we get as 
expression for the (conditional) MSE| j5-=t : 



(2m)MSE\K=k 



(2m)(2m - k) (^'""'^"*) J f (»)<"'-*>(! - Fd,))*""-" /(«) X 



(A.50) 



which we have written in a way to be able parallel the preceding subsections. Denote the value of the inner 
integral by Hk(u) and 

A(s, u) := (F(u) - F(s))/F(u) (A.51) 

In the inner integral, < A(s, w) < 1, and for zl(.v, u) > a > 0, Hk(u) will decay exponentially while being 
dominated, so if we introduce 

S(u) := supji < u\F{s) < (1 - a)F(u)] (A.52) 
in fact we may restrict the inner integral to 

//t(«) = o(m-')+^^ r (u + sf{l-A(s,u)f-^-''^f{s)ds (A.53) 

But then expanding log(l - A(s,u)), and in order to get the right order for the expansion substituting 
u = ul '\Jm, s = s/ '\Jm — according to case (I), i.e.; \u\ < const ^log(m)/m. Thus, for polynomials Qt in 
J, u defined in analogy to the Qi in the to odd-sample case and with may be looked up in the MAPLE script, 

A(s, u) = 2 + + Sip + e^M) + 0((^ fl^) 

Hence we get 

(m-/t- l)log(l -zI(.s,h))- Vfn(2/o(i-5)) = logH21(«,M) + o( Vlog(n)/«) (A.54) 

for some function logH21, the exact expression of which may be produced in the corresponding MAPLE 
script. Thus, denoting the term exp(2 Vm /o (S - «)) by H i(s, u), we get 

(I -zl(.s,M))<"'"*"" = //;i(4-,«)exp(logH21) X (1 + o( ^\og(n)ln)). 

Now, if we write H-2,i(s,u) for (.v + u)^ f(s), and Ht,2,2U,u) for exp(logH21), and if we introduce 
Ht,2(s, u) := H-2,i(s, u)Hk-2,l(s, u), we get 



4F(u) Hk(u) = o(n-^) + H ■x{s,u)Ht2(s,u)ds 

J5(u) 
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The next step is to integrate out i where we may drop the lower restriction again due to the exponential 
decay far out for large values of s. After three times of integration by parts we come up with 



4Fiu)Hk{u) = o(n-'-) + J] 



(-1)' d' |. 

-H-i(s, u)—Hk-2{s,u)\ 



(A.55) 



that is we may restrict ourselves to these terms for our purposes. These differentiations can be done by the 
MAPLE command di£f . Noting that essentially ? = 0( ^Jlog(n)/n), we hence get for the inner integral H 

So in formula jA. 10) (with j = k) we replace by Hii(t} and arrive at 



sup«MSE(M,„G*"') 

0<"> 



111 ^ 



Ht{t)gn,k.k{t)dt P(K = k) + o(n-') 



(A.56) 



Proceeding now just as in the preceding subsections, we obtain the assertion. 



A.9 Proof of Proposition 3.9 



For t > Vlog(n)/«/(2/o), let 



At., := j J] f/,(2I(X, </)-!) <*:-!} 



(A.57) 



Hence if t > ^\og{n)/n/{2fo), by jX20) , for all /t > (1 - S)r y/n, 

Pr(Ai.., \K = k)>po 



(A.58) 



Now we proceed as in the proof to Theorem 3.2 But t > ^log(n)/n/(2/o) <=^ y > ^log n in jA.45) . 
Hence on the event Aj , for y e [ ^log n; -y/log n ), we get the bound J{t) < (k - \)l yfn, while for 
y £ (-fo -\/log(n); yjlogn ) respectively on ^A^.r, we bound J(/) by /:/ V". Integrating out these two y- 
domains separately, we obtain 

n (MSE[Med„, g[,"' |^ = /t ] - MSE[Med,„ G*"' = /t]) > 



/-»K2 Viog /' 

I [sl^r + kl^r-\ / {2 fori)) if(s) ds + o(n" ' ) 

A/log » 



2/0 Jyiog 



But for < a\ < a2 < 00, ip{ai )lai - (fi{a2)/a2 < j^^ <p(s) ds, so that with ai = 2 -y/log n, 02 = ^2 -\/log n , 
and as 40(02) = oC" ' ). 



(i(MSE[Med„,G|,"' = MSE[Med„ , G*"* \K = k]) > 



P° +o(„-') 



2V2^n/o 



By Lemma |A.2| the restriction to (1 - 6)r Vn < K < k[r 'Jn may be dropped, and we obtain the assertion. 
The case of an even sample size is proved similarly. □ 



Extra Material 



On [site of the journal] we have additional supplementary material for this article: This comprises extended 
tables, details to points alluded to in remarks, but in particular a MAPLE script, referred to in the proofs. 
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Fig. 2 A horrifying example: (The first two pages of) the expression f or jA.44) got from MAPLE; of course, 
after integration terms get much more treatable, as visible in Theorem|3.2| 
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